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DATA  BASE  USERS*  GUIDE  FOR  THE 
CHEMICAL  AGENT  SIMULANT  DATA  CENTER 


1.  INTRODUCTION 

The  increasing  difficulties  associated  with  open  air  testing  with  toxic  chemicals 
requires  the  use  of  less  toxic  simulants  to  minimize  adverse  environmental  impacts. 
Simulants  can  also  reduce  the  hazard  to  laboratory  personnel  and  the  cost  of  waste 
disposal.  Because  any  compound  that  is  not  a  surety  compound  is  a  potential 
simulant,  the  selection  process  for  an  optimum  simulant  can  be  overwhelming. 
Fortunately,  computers  and  computer  data  bases  are  available  or  can  be  established 
to  simplify  the  search. 

The  U.S.  Army  Chemical  Research,  Development  and  Engineering  Center 
(CRDEC),  other  Department  of  Defense  organisations,  and  contractors  engaged  in 
simulant  research  recognized  the  need  for  a  centralized  data  repository.  The 
requirements  are  (1)  quick  access  to  pertinent  data,  (2)  literature  references,  (3) 
identification  of  data  gaps,  (4)  and  the  capability  to  identify  and  resolve  conflicting 
data.  CRDEC,  as  the  lead  organization  in  simulation  of  toxic  chemical  warfare  (CW) 
agents,  established  the  Chemical  Agent  Simulant  Data  Center  (CASDC)  in  response 
to  that  need. 

A  simulant  cannot  exactly  match  all  the  physical/chemical  properties  of  a 
chemical  agent.  The  structural  features  that  determine  toxicity  usually  have  the 
greatest  influence  on  physical/chemical  property  determination.  The  data  base, 
therefore,  has  numerous  organqphosphorus  compounds  for  screening  nerve  agent 
simulants,  sulfides  for  mustard  simulants,  or  other  compounds  of  the  same  chemical 
class  as  the  compound  to  be  simulated.  Tha  CASDC  allows  you  to  starch  for 
structural  features  end  physkal/chamkal  properties,  thereby  permitting  structure 
versus  property  correlation  analysis.  Tha  data  base  should  contain  as  many  potential 
simulant  compounds  as  possible  to  increase  the  probability  of  identifying  the 
fWwiftl  that  most  closely  matrhee  the  properties  of  the  agent.  The  should 

have  a  preponderance  of  chemicals  that  have  available  property  data,  have  bean 
screened  for  toxicity,  and  have  environmental  impact  data  available  to  AtmfaiA  the 
cost  of  providing  the  required  information. 

The  purpoee  of  the  CASDC  is  to  assist  simulant  ueers  in  rhisoaing  and 
employing  simulants  through  the  compilation  of  a  comprehensive  data  base  on 
related  compounds.  It  must  ako  provide  a  quick  and  practical  method  to  aecem  the 
A± Thi  CASDC  mrkm  %  thmlmt  for  4  puticukr  ippltitki 


after  a  rational  evaluation  of  essential  properties;  however,  it  does  not  certify  or 
approve  simulants  for  release  into  the  environment.  It  also  does  not  serve  as  a 
clearing  center  for  simulant  use  permission.  The  final  decision  regarding  simulant 
use  is  the  responsibility  of  the  principal  scientist  overseeing  the  investigation. 

2.  METHODS 

2.1  Literature  Sources 

Through  examination  of  CRDEC  bibliographies  in  CRDEC's  Technical 
Library,  automated  literature  searches  through  the  Defense  Technical  Information 
Center's  data  base,  and  cited  references  in  simulant  related  literature,  a  bibliography 
of  1,000  references  was  compiled.  Sources  include  DOD  services  and  other 
government  agencies,  private  contractors,  foreign  sources,  and  the  open  literature. 
Various  research  and  testing  activities  use  simulants  for  spray,  explosive  and  bulk 
dissemination;  standard  and  experimental  methods  of  decontamination;  different 
methods  of  agent  detection  and  identification;  troop  training  and  trialing  to  include  a 
total  intake  simulant;  collective  protection  studies;  pick-up  and  transfer  studies;  and 
contamination  avoidance.  These  are  general  simulant  application  descriptions  and 
may  require  different  simulants  for  specific  applications  within  each  activity. 

The  initial  search  strategy  was  to  address  a  single  application  at  a  time 
beginning  with  dissemination.  These  simulants  were  the  first  used  and  recently  have 
been  recommended  for  other  applications.  As  a  result  of  numerous  queries  to  the 
CASDC,  it  became  clear  that  all  applications  should  be  examined,  with  emphasis  on 
recency,  so  that  state-of-the-art  in  simulant  research  could  be  reflected.  This 
required  the  determination  of  what  agents,  applications,  and  properties  have  been 
considered  in  simulant  research. 

Simulant  compilations  were  examined  first  so  that  a  listing  could  be  generated 
quickly.  After  initial  identification  and  data  extraction,  various  standard  chemical 
data  sources  were  examined  to  amass  as  much  data  as  possible.  Numerous 
compounds  were  added  to  the  data  base  by  recommendations  from  simulant 
researchers  with  further  data  gleaned  from  standard  chemical  references. 

2J  Datatypes 

To  identify  the  types  of  data  to  be  incorporated  Into  the  data  baas,  a 
preliminary  review  of  simulant  literature  and  interviews  with  key  simulant 
researchers  was  done.  As  a  result  we  decided  to  include  as  many  physical/chemical 
properties  as  possible  to  accommodate  various  simulant  applications  and  to  ensure 
that  unforeseen  future  property  data  needs  are  met.  The  data  base  also  required  the 
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inclusion  of  data  at  different  conditions  for  dependant  variables  and  conflicting  data 
from  different  sources.  Various  simulant  applications  require  the  "***'•*» *"|  of  one  or 
more  properties  judged  essential  for  duplicating  the  agent  in  an  experiment.  The 
properties  list  is,  therefore,  a  summation  of  essential  properties  covering  all  simulant 
applications.  Table  1  lists  the  datatypes  number,  field  name,  field  type,  and  units 
label  in  the  order  in  which  they  appear  in  the  data  base.  New  datatypes  will  be 
added  as  they  are  identified.  An  effort  is  made  to  standardise  unite  of  measure  and 
to  state  dependent  variables  wherever  possible.  Originators  of  other  data  bases 
within  CRD  EC  should  attempt  to  use  common  unite  of  measure  that  will  simplify 
queries  and  data  transfer  among  various  data  bases. 

U  Property  Estimations 

To  fill  some  of  the  data  gaps  in  the  data  base,  property  estimation  routines  are 
employed.  These  estimates  are  used  until  other  sources  for  the  data  have  been 
encountered.  The  routines  are  derived  from  open  literature,  computer  programs,  or 
developed  by  personnel  in  the  Chemometrics  and  Biometrics  Modeling  Branch 
(CBM),  Physics  Division,  Research  Directorate,  CRDEC.  Estimated  data  is 
appropriately  noted  with  an  insertion  of  "set*  in  the  comments  field.  This  type  of 
data  can  also  be  discerned  because  all  data  estimations  or  calculations  generated  by 
CBM  personnel  use  reference  citation  54.  Diminishing  the  number  of  data  gape 
helps  increase  the  probability  that  the  most  appropriate  candidate  simulant  will  be 
identified.  The  CASDC  usee  estimation  routines  for  the  following  properties: 


vapor  praeeure 
surface  tension 
vapor  density 

of  viporisilion 
bast  capacity 
water  solubility 
refractive  index 
molar  volume 


volatility 

density 

point 

heat  of  formation 
viscosity 

octanol/watar  partition  coaflkient 
Hildebrand!  solubility  parameter 
molar  refraction 


2.4  Data  Bast  Management  System 

The  CASDC  was  initially  located  on  the  CRDEC%  Unhrac  1106  computer 
using  CREAT-A-BASE  (Lev*)  5E)  management  software  (NDX  Corporation, 


TABLE  1.  INDEX  OF 


Datatype 

Field 

Number 

Name 

Fixed 

Datatypes 

1 

CRD  EC  Number 

2 

Date  Entered 

3 

Molecular  Weight 

4 

CAS  Registry  Number 

5 

Wiswesser  Line-Notation 

6 

Military  Designation 

Flexible 

Datatypes 

7 

Synonyms 

8 

Vapor  Pressure 

9 

Density 

10 

Vapor  Density 

11 

Boiling  Point 

12 

Melting  Point 

13 

Molar  Volume 

14 

Surface  Tension 

15 

Viscosity 

16 

Vapor  Viscosity 

17 

Volatility 

18 

Diffusion  Coefficient 

19 

Refractive  Index 

20 

Molar  Refraction 

21 

Decomposition  Temperature 

22 

Flash  Point 

23 

Autoignition  Temperature 

i  t 
i  ■ 

DATATYPES 


Field  i  Units 

Type ;  Label 

j 


CRDEC-00000 

MM/DD/YY 

Numeric 

00000-00-0 

Text 

Text 


Text 

Numeric 

Torr 

Numeric 

g/cm3 

Numeric 

Air=l 

Numeric 

°C 

Numeric 

°C 

Numeric 

cms/mole 

Numeric 

dynes/cm 

Numeric 

centipoise 

Numeric 

micropoise 

Numeric 

mg/m3 

Numeric 

cm5/s 

Numeric 

sodium  D  line 

Numeric 

cm3/gm  mole 

Numeric 

°C 

Numeric 

°C 

Numeric 

°C 
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24 

Oxygen  Index 

I1 

f 

Numeric 

% 

25 

Heat  of  Vaporization 

Numeric 

cal/gm 

2G 

Heat  of  Combustion 

Numeric 

cal/gm 

27 

Heat  of  Fusion 

Numeric 

cal/gm 

28 

Heat  of  Formation 

Numeric 

cal/gm 

29 

Energy  of  Vaporization 

Numeric 

cal/gm 

30 

Heat  Capacity 

Numeric 

cal/gm  °C 

31 

Specific  Heat 

Numeric 

cal/gm  °C 

32 

Hydrolysis  Rate 

Text 

33 

Oct/Water  Partition  Coefficient 

Numeric 

Log  P 

34 

Hildebrandt  Solubility  Parameter 

Numeric 

(cal/cm3)*^ 

35 

Hygroscopic  ity 

Text 

36 

Water  Solubility 

Text 

37 

Water  Solubility  Value 

Numeric 

gm/100  mL 

38 

Critical  Temperature 

Numeric 

°C 

39 

Critical  Pressure 

Numeric 

Atmospheres 

40 

Critical  Volume 

Numeric 

cms/mo!e 

41 

Critical  Density 

Numeric 

grr./cm5 

42 

Dielectric  Constant 

Numeric 

Dimensionless 

43 

Dipole  Moment 

Numeric 

debyes 

44 

Toxicity 

Text 

45 

Chemical  Reactivity 

Text 

46 

Simulant  Application 

Text 

47 

Industrial  Application 

Text 

48 

Comments 

Text 

Houston,  TX).  The  decision  to  make  the  CASDC  an  on-line  accessible  data  base 
required  finding  a  more  functional  and  user-friendly  data  base  management  system. 
This  was  done  by  creating  the  Data  Management  Office  (DMO),  after  much 
discussion  and  surveys  of  CRDEC’s  data  base  requirements,  and  by  purchasing  the 
Molecular  Access  System  (MACCS)  (Version  5.14,  marketed  by  Molecular  Design, 
Ltd.)  data  base  management  system  and  the  VAX  Rdb/VMS  Version  2.1  (Digital 
Equipment  Corporation,  Maynard,  MA)  for  installation  on  a  dedicated  VAX  780. 
MACCS  is  a  user-friendly,  menu-driven,  chemical-  oriented  system  that  should  fulfill 
CRDEC’s  chemical  data  basing  needs.  It  meets  all  the  requirements  for  the  CASDC 
and  offers  additional  capability  in  structure  storage  and  structure/substructure 
retrieval.  Each  compound  entered  into  the  data  base  must  have  its  structure  defined, 
or  no  structure  available  specified,  before  it  can  be  registered.  Structure  storage  is 
required  to  remove  any  doubt  about  a  compound’s  identity.  This  allows  anyone 
wishing  to  query  for  complete  structures  or  substructure  fragments  to  be  able  to 
exactly  match  the  structural  features  they  define. 

J.  ACCESSIBILITY 
U  Eligibility  and  Passwords 

The  CASDC  is  available  to  CRDEC,  DOD  services,  other  government  agencies, 
and  their  contractors  having  a  requirement  for  simulant  related  data.  Individuals  not 
having  a  need  for  frequent  access  can  obtain  information,  have  their  queries 
processed,  and  have  printed  data  mailed  by  contacting  Phillip  Coon,  the  CASDC 
data  base  manager,  or  his  alternate  Georgs  Faxnini  at  the  following  addrem: 

Commander 

U.S.  Army  Chemical  Research,  Development 
and  Engineering  Center 
ATTN:  SMCCR-RSP-C 

Aberdeen  Proving  Ground,  MD  21010-5423 
Commercial  telephone  inquiries  can  dial  (301)671-3243/2775,  whereas  AUTOVON 
network  can  use  564-3248/2775.  Those  requiring  frequent  use  and  desiring  a 
password  must  contact  the  DMO  at  the  following  address: 

Commander 

U.S.  Army  Chemical  Research,  Development 
and  Engineering  Center 

ATTN:  SMCCR-TDT/Steve  Lawhome 

Aberdeen  Proving  Ground,  MD  21010-5423  Commercial  telephone  inquires 
to  the  DMO  can  dial  (301)671-2035  or  AUTOVON  users  dial  584-2038. 


3.2  Data  Base  Access 

The  DMO  computer  can  be  accessed  by  selecting  2-DMO  from  the  CRDEC 
workplace  automation  PACXNET  of  available  systems.  The  remote  long  distance 
number  into  the  CRDEC  PACXNET  is  1-800-826-3460.  The  DMO  computer  can 
be  accessed  directly  from  locations  having  TELNET  service.  Typing  (open  crdec- 
dmo.apgea.army.mila)  at  the  TELNET  prompt  provides  this  direct  access.  As 
previously  mentioned,  a  valid  user  id  and  password  from  the  DMO  is  required. 
Typing  ‘'menu'  at  the  dmo  prompt  provides  the  DMO  menu  of  data  bases  (Figure  1). 
Selecting  option  2,  'Simulants,'  accesses  the  CaSDC  menu  of  options  (Figure  2).  Use 
this  menu  to  conduct  searches,  generate  data  reports  for  compounds  of  choice,  or  to 
exit  to  the  DMO  menu. 

M  Conducting  Searches 

Different  approaches  can  be  used  in  executing  queries  within  the  MACCS 
system.  A  query  can  use  different  options  in  search  of  specific  data.  The  reasons  for 
selecting  certain  query  paths  depend  on  your  wish  to  identify  certain  compounds  or 
to  retrieve  data  points.  The  following  search  strategy  is  an  abridgement  of  the 
available  options  and  represents  a  simple  approach  to  data  retrieval.  MACCS  offers 
38  pages  under  '(H)ELP'  to  assist  in  executing  all  system  options. 

After  selecting  option  1,  in  order  to  conduct  a  data  search,  the  user  is 
presented  with  the  EXECUTIVE  MODE  menu  of  MACCS  (Figure  3).  Input  can  be 
either  'graphics'  (default)  using  a  cursor  control  device  or  '(k)ey board'  when  in 
EXECUTIVE  MODE.  The  option  modes  for  conducting  data  searches  are 
"(S)EARCH,“  '(D)ATA,’  and  '(F)IND."  Query  results  in  8EARCH  MODE  (Figure 
4)  display  internal  and  external  registry  numbers  and  compound  name.  A  listing  is 
also  created  that  can  be  used  es  the  reference  list  for  further  data  searches.  The  lists 
are  also  used  in  data  manipulation  and  transfer.  Query  resnhe  in  DATA  MODE 
(Figure  6)  display  entry  numbers,  datatype  name  and  number,  and  any 
accompanying  data  without  identifying  the  compound  by  tm*.  No  further 
treatment  of  the  output  can  be  conducted  as  no  lists  are  created.  Although  data 
starches  can  be  conducted  in  DATA  MODE,  its  primary  function  Is  data  entry, 
deletion,  manipulation,  and  transfer.  DATA  MODE  is  also  used  to  view  individual 
data  points  for  compounds  identified  in  SEARCH  MODE.  To  display  a  Hating  of 
datatypes,  acquire  datatype  entry  numbers,  determine  unite  of  measure  or  to  preview 
comments,  select  '(D) ATA'  and  then  *(J)ndex.'  After  the  'datatypes* *  prompt, 
specify  the  datatype  number (s),  a  range  of  numbers,  or  (all)  for  a  complete  listing. 
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3.3.1  Name  Searches 

Chemical  name*  can  be  searched  in  two  ways.  The  first  uses  the  '(F)  IND 
(N)AME'  option  in  EXECUTIVE  MODE  or  '(N)ame'  option  in  SEARCH  MODE. 
This  queries  the  TUPAC  convention  name  that  is  entered  at  the  time  a  structure  is 
registered.  Name  fragment  searches  use  the  O  sign  before  and/or  after  a  name 
fragment  as  a  universal  substitute.  A  second  method  can  be  used  if  no  compounds 
are  selected  using  NAME  options.  This  method  searches  '(d)atatype  7,'  Synonym* 
in  '(S)EARCH'  or  '(D)ATA'  MODE.  Synonyms  include  all  other  names  and 
identifiers.  No  0  sign  is  required  for  synonyms.  Synonyms  are  a  textual  search 
descriptor  that  recognises  any  character  string.  Typing  '(e)xit' when  in  SEARCH  or 
DATA  MODE  will  return  the  user  to  EXECUTIVE  MODE. 

UJ  Data  Searches 

Data  searches  for  numeric  or  text  data  an  best  conducted  in  "(S)EARCH' 
MODE,  using  the  '(D)ata'  option.  The  user  will  be  able  to  identify  compounds  by 
name  and  to  generate  lists.  Specific  data  values  can  then  be  examined  in  '(D)ATA' 
MODE  or  by  using  '(F)IND  (D)ATA.'  A  data  search  in  *(D)ATA*  MODE  using  the 
'(S)earch'  option  will  not  identify  a  compound  by  name.  Figure  5  shows  data 
retrieved  using  this  option.  Data  searches  are  always  dooe  on  the  Reference  List 
unless  it  is  sero;  in  which  case  the  search  is  dons  on  all  compounds  in  the  data  base. 
To  sero  lists,  select  *(Z)ero  (R)eference*  or  *(A)cUvt*  when  in  SEARCH  MODE  or 
‘(Z)ero'  when  in  DATA  MODE.  Numeric  searches  are  made  by  identifying  the 
datatype  number  at  the  prompt  then  opacifying  a  from/to  range  separated  with  a 
space  or  dash.  Dependent  vari&blas  or  any  other  item  in  the  comment  field  can  be 
specified  by  using  a  slash  and  the  desired  character  string  immediately  after  the 
upper  bound.  An  example  of  a  query  tangs  for  normal  boiling  point  would  be  *169 
170/0760.*  Figures  4  and  5  chow  the  contrasting  results  of  this  query  in  SEARCH 
MODE  and  DATA  MODE  respectively.  Sometimes  it  h  advisable  not  to  specify 
dependent  variables,  as  some  data  have  dependent  variables  unstated.  Dependent 
variables  and  comments  can  be  scrutinised  after  a  Hating  is  generated 

Searching  of  multiple  datatypes  with  a  single  query  cannot  be  executed. 
Searc  Ung  for  more  than  one  property  must  be  done  sequentially  in  SEARCH 
MOT  E.  After  the  initial  property  search,  the  resulting  fisting,  or  active  fist,  becomes 
the  reference  fist  by  selecting  *(R)ead  (R)eference*  and  than  typing  'act*  This  is 
done  each  time  the  active  fist  is  to  become  the  reference  fist  The  rsfsranco  fist 
remains  uuti!  H  is  earned.  Table  2  is  an  example  of  a  muhipis  property  search  for  a 
mustard  simulant  showing  properties,  range,  available  compounds,  and  number  of 
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qualifying  compounds.  All  properties  have  a  range  of  ±10%  except,  vapor  pressure. 
The  table  demonstrates  how  the  initial  319  compounds  are  diminished  each  time  a 
new  property  is  queried.  The  final  selection  was  methyl  salicylate.  Table  3  is  a 
comparison  of  the  six  selected  property  values  for  mustard  and  methyl  salicylate. 

3.4  Generating  Reports 

The  CASDC  provides  a  means  of  generating  reports  of  all  compiled  data  for 
compounds  of  interest  resulting  from  a  data  search.  The  report  format  was  created 
using  DATACCS,  which  is  external  to  MACCS  and  located  on  the  DMO  VAX  780. 
A  five-page  report  was  required  to  accommodate  all  the  datatypes.  After  completing 
a  data  search  the  user  exits  MACCS  by  typing  or  selecting  the  '(E)XIT'  option  while 
in  EXECUTIVE  MODE  and  is  returned  to  the  CASDC  menu  (Figure  2).  With  a  list 
of  entry  numbers  for  compounds  of  interest  in  hand,  the  user  selects  option  2  and 
states  the  type  of  terminal  in  use  as  requested.  At  this  point,  the  user  is  requested  to 
enter  the  entry  number  for  the  compound  of  interest.  A  <RET>  is  used  to  generate 
each  page  of  the  report.  Type  ”(e)xit”  after  the  last  page  to  return  to  CASDC  menu 
This  procedure  is  repeated  for  each  compound  for  which  a  report  is  desired.  Figures 
6-10  are  examples  of  a  complete  report  of  the  mustard  simulant  methyl  salicylate. 
The  user  ends  the  session  by  selecting  the  'quit'  option  of  the  CASDC  menu,  the 
"quit”  option  of  the  DMO  menu,  and  logs  off  with  a  'io"  at  the  dmo  prompt. 

4.  SUMMARY  AND  DISCUSSION 

Personnel  at  CRDEC  (CBM  Branch)  established  the  CASDC.  These  personnel 
are  responsible  for  data  base  maintenance  and  frequently  add  new  compounds,  new 
data  points,  or  define  new  datatypes.  The  CASDC  is  an  on-line,  user-friendly, 
computerized  data  base  of  compounds  that  have  been  used,  recommended,  screened, 
or  otherwise  considered  as  chemical  agent  simulants.  The  data  base  is  available  to 
CRDEC,  other  DOD  services,  other  government  agencies,  and  contractors  engaged 
in  simulant  research.  Currently,  there  ^re  600  compounds  entered  with  50  different 
types  of  data  that  can  be  requested.  Available  to  the  user  is  a  report  generator  that 
provides  a  copy  of  all  compiled  data  for  compounds  of  interest.  New  compounds, 
data,  and  datatypes  are  added  as  they  are  encountered  or  as  requests  are  received  for 
inclusion. 

Along  with  identifying  and  entering  new  data,  the  data  base  compilers  are 
developing  new  descriptors  to  better  define  simulant  applications  and  selection 
criteria.  The  data  base  managers  expect  that  requests  for  simulant  use,  application, 
selection  criteria,  agent/simulant  correlation  or  other  similar  data  will  generate  a 
report  similar  to  Figure  11. 
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Fleur*  11.  SIMULANT  USE  REPORT 


Personnel  responsible  for  the  CASDC  solicit  all  individuals  having  comments 
for  improving  the  data  base  to  offer  their  recommendations.  This  is  the  principle 
way  that  the  CASDC  can  be  updated.  Your  imput  will  ensure  that  the  data  base 
reflects  state-of-the-art  in  Simulant  research  and  that;  all  pertinent  compounds  are 
entered  into  the  data  base. 
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APPENDIX 
DATA  DICTIONARY 


•  Name:  an  unassigned  fixed  datatype  established  at  the  time  a  compound  is 
registered.  To  assure  continuity,  an  effort  is  made  to  follow  rules  of 
nomenclature  as  established  by  The  Council  of  the  International  Union  of  Pure 
and  Applied  Chemistry  (IUPAC). 1  A  name  search  is  done  under  NAME  option 
of  SEARCH  mode  or  FIND  NAME  option  of  EXECUTIVE  mode.  If  not 
found,  a  search  should  be  made  of  synonyms  under  SEARCH  or  DATA  mode. 


•  Formula:  empirical  formula  of  compounds  following  the  Hill  convention.  It  is 
an  unassigned  fixed  datatype.  It  is  system  generated  at  time  of  registration  and 
is  accessed  under  FORMULA  option  of  SEARCH  mode. 

1.  CRDEC  Number  (External  Registry  Number):  externally  assigned  fixed 
datatype  that  will  identify  the  compound  throughout  all  CRDEC  data  bases. 
Some  coordination  is  required  between  Data  Management  Office  and  data  base 
generators  to  ensure  the  singularity  of  this  identifier.  All  data  bases  on  the 
system  should  be  queried  for  this  compound  to  determine  if  a  CRDEC  number 
has  been  assigned  or  needs  to  be  assigned. 


2.  Date  Entered:  a  system  generated  fixed  datatype  that  represents  the  date  the 
compound  was  initially  registered.  It  uses  American  format  of  MM/DD/YY. 


3.  Molecular  Weight:  a  fixed  datatype  that  is  the  sum  of  atomic  weights  of  all 
atoms  in  the  molecule  based  on  Carbon  12.  It  is  system  generated,  and  units 
are  understood  as  grams  per  mole  (g/mol). 


4.  CAS  Registry  Number:  a  singular  identifier  assigned  by  Chemical  Abstract 
Service  and  represents  the  most  universal  code  for  compound  identification 
used  throughout  the  chemical  industry.  CAS  was  established  in  1065,  and 
over  4  million  substances  have  been  registered.  It  is  a  formatted  field  using  the 
form  00000-00-0  with  leading  zeros  omitted. 
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5.  Wiswesser  Line-Notation  (WLN):  this  is  an  important  indexing  tool  that 
establishes  the  topology  and  make-up  of  a  molecule  using  an  unambiguous 
code.  ^  Many  compilations  are  indexed  using  WLN,  and  some  computerized 
data  bases  access  or  generate  structures  using  this  line  code.  It  is  a  text  field 
and  can  be  used  for  accessing  certain  compounds  or  querying  for  various 
structural  features. 


6.  Military  Designation:  this  descriptor  was  designed  to  reserve  a  fixed  datatype 
for  future  use  and  contains  no  data. 


7.  Synonyms:  a  listing  of  other  chemical  names,  trade  names,  code  names, 
abbreviations,  acronyms,  etc.  If  a  compound  is  not  found  under  the  NAME 
option  of  SEARCH  or  FIND  mode,  it  can  be  searched  under  this  datatype 
using  a  full  name  or  name  character  string. 


8.  Vapor  Pressure:  the  pressure  exerted  by  a  substance  in  equilibrium  with  its 
own  vapor.  ^  It  is  temperature  dependent  and  increases  with  an  increase  in 
temperature  and  whose  standard  unit  is  millimeters  of  mercury  (mmHg),  also 
called  Torr.  CRDEC  has  compiled  estimation  routines  using  various 
procedures.  These  data  are  used  until  experimental  data  are  encountered. 


9.  Density:  the  mass  of  a  substance  that  is  temperature  dependent  and  whose 
units  are  grams  per  cubic  centimeter.  It  is  understood  to  be  equivalent  to 
specific  gravity  (ratio  of  equal  mass  to  water  at  4  *C)  when  a  single  temperature 
is  cited,  otherwise  the  comparing  temperature  of  water  is  the  second 
temperature  cited  following  the  slash. 


10.  Vapor  Density:  the  ratio  of  equal  volumes  of  vapor  and  dry  air  under  identical 
conditions  of  temperature  and  pressure.  It  can  be  calculated  as  the  molecular 
weight  of  the  chemical  divided  by  that  of  dry  air  that  has  been  approximated 
to  be  29.**  The  resuit  [s  the  ratio  of  chemical  vapor  to  that  of  air  and,  therefore, 
is  unitless. 


11.  Boiling  Point:  the  temperature  at  which  the  vapor  pressure  of  a  chemical  is  in 
equilibrium  with  the  surrounding  system  and  is  pressure  dependent.  All 
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boiling  point*  have  their  requisite  pressures  stated.  Normal  boiling  points 
within  a  range  can  be  searched  by  typing  /760  immediately  after  the  upper 
bound.  Units  are  in  degrees  Celsius. 


12.  Melting  Point:  the  temperature  at  which  a  substance  changes  from  a  solid  to  a 
liquid.  It  is  generally  equivalent  to  the  freezing  point  or  is  specified  as  a  range. 
This  datatype  can  be  used  to  ascertain  if  a  chemical  is  liquid  or  solid.  Units 
are  in  degrees  Celsius. 

13.  Molar  Volume:  volume  occupied  by  1  mole  and  is  equal  to  the  molecular 
weight  divided  by  the  density. 1  Units  are  cubic  centimeters  per  mole. 

14.  Surface  Tension:  the  force  exerted  on  the  plane  of  the  surface  per  unit  length 
that  opposes  spreading  and  tends  to  shape  liquids  into  drops.*  Values  are 
expressed  in  dynes  per  centimeter  and  decrease  with  an  increase  in 
temperature.  In  the  absence  of  literature  values,  the  estimation  method  of 
MacLeod  and  Sugden  is  used. 

15.  Liquid  Viscosity:  the  measure  of  liquid  resistance  to  a  change  of  form  or  a 
form  of  internal  friction. 1  It  is  expressed  in  dyne-aecond*  per  square 
centimeters  or  poise.  Kinematic  viscosity  is  the  ratio  of  viscosity  to  density 
and  whose  unit  is  the  stoke.  Stokes  are  converted  to  poise  by  multiplying  by 
density  at  the  given  temperature.  Viscosity  estimation  is  done  using  Chemast;  a 
computer  package  based  on  the  Handbook  of  Chemical  Property  Estimation 
Methods  using  the  method  of  Orrick  and  Erbar. 

16.  Vapor  Viscosity:  same  as  liquid  viscosity  with  unit  expressed  in  micropoise. 

17.  Volatility:  the  mass  of  saturated  vapor  concentration  measured  in  milligrams 
per  cubic  meter.  It  is  also  known  as  the  saturated  vapor  density  or 
concentration.  It  can  be  calculated  from  vapor  pressure  (VP)  and  molecular 
weight  (MW)  data  with  temperature  (T)  in  Kelvin  degrees  using  the  equation: 4 

.  VPxMWx  16020 

Volatility  - - - -  (1) 
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18.  Diffusion  Coefficient:  a  transport  property  that  measures  the  flux  acmes  a 
plain  of  binary  g—  concentration  gradients.^  It  is  the  diffusion  of  chemical 
vapor  in  air  measured  in  square  centimeters  per  second. 


10.  Refractive  Index:  the  ratio  of  the  velocity  of  light  in  a  vacuum  to  Ha  velocity 
in  the  substance.  *  The  light  source  is  the  sodium  D  fine  and  is  a  dimensionless 
parameter  ranging  from  1.3  to  13  for  organic  liquids.  Also  called  index  of 
refraction,  absolute  index  of  refraction,  absolute  refractive  index,  or 
refractivity.  It  is  a  function  of  wavelength,  temperature,  and  pressure. 


30.  Molar  Refraction:  a  parameter  indicating  the  way  that  a  molecule  interacts 
with  light.  It  is  calculated  from  the  equation: 7 


n»-l  1  MW 
»*+2  d 


(2) 


n  *  refractive  index 

MW  *  molecular  weight 

d  *  density 


31.  Decomposition  Temperature:  the  temperature  at  which  a  chemical  decomposes 
or  separates  Into  two  or  more  different  substances.  Units  are  degrees  CaMus. 


23.  Flash  Point:  the  temperature  that  a  volatile  combustible  substance  will  ignite 
when  exposed  to  a  flame.  Two  methods  commonly  used  are  the  Tag  closed  cup 
(cc)  or  the  Cleveland  open  cup  (oc)  which  is  usually  slightly  higher  then  the 
closed  cup  method.  *  Units  are  degrees  Cekius. 


33.  Autoignition  Temperature:  the  temperature  at  which  a  substance  will  ignite 
without  an  ignition  source  such  as  flame  or  spark  and  measured  fat  digress 
Celsius. 
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24.  Oxygen  Index:  the  percentage  of  oxygen  in  nitrogen  that  will  cauae  a  chemical  j 
to  sustain  a  candle  type  flame  when  the  gae  is  passed  up  and  around  the 
chemical 9  It  is  used  as  a  flammability  or  flash  indicator. 

I 

t 

25.  Heat  of  Vaporization:  the  quantity  of  heat  necessary  to  change  ooe  gram  of  i 
liquid  to  vapor  without  a  change  of  temperature*,  measured  in  calories  per  ; 
gram.  It  varies  with  temperature  and  is  normally  stated  at  the  boiling  point 
with  slightly  higher  values  at  ambient  temperatures.  Also  called  the  latent  heat 
of  vaporisation,  heat  of  evaporation,  or  enthalpy  of  vaporisation.  CRDEC 
employs  various  estimation  methods  for  this  property  depending  on  the 
avail'  Miity  of  other  data. 


26.  Heat  of  Combustion:  the  amount  of  heat  generated  by  the  combustion  of  one 
gram  mole  weight  of  the  substance  measured  in  calories  per  gram.1 


27.  Heat  of  Fusion:  the  quantity  of  heat  necessary  to  change  one  gram  of  solid  to  a 
liquid  with  no  temperature  change  measured  in  calories  per  gram.*  Also  termed 
latent  heat  of  fusion,  enthalpy  of  fusion,  or  enthalpy  of  melting. 

28.  Heat  of  Formation:  the  quantity  of  heat  involved  in  the  formation  of  1  mol  of 
substance  from  its  elements  in  their  standard  states,  measured  in  calories  per 
gram.*  A  negative  sign  preceding  the  value  signifies  hast  is  evolved  and  no 
sign  or  positive  sign  shows  that  heat  is  absorbed. 

28.  Energy  of  Vaporisation:  equivalent  to  heat  of  vaporisation  minus  any  wort 
dona,  measured  in  calories  per  gram.  Can  be  calculated  as  the  heat  of 
vaporisation  in  calories  per  gram  mole  minus  the  gas  constant  (1.967  cal/mol 
degree)  times  the  temperature  in  Kelvin. 

SO.  Heat  Capacity:  the  quantity  of  heat  required  to  increase  the  temperature  of  a 
substance  one  degree.7  It  is  the  ratio  of  absorbed  heat  to  the  resulting 
temperature  increase.  Units  of  measure  are  calories  per  degree  Celsius. 
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31.  Specific  Heat:  the  term  historically  meant  the  ratio  of  heat  capacity  of  a 
substance  to  that  of  an  equal  mass  of  water.  Recently,  it  meant  the  ratio  of 
heat  capacity  to  the  mass  of  a  substance  and  could  more  appropriately  be  called 
the  specific  thermal  capacity  .  When  measured  in  calories  per  gram  degree 
Celsius  it  is  equivalent  to  the  historic  older  definition.  Specific  heats  should  be 
specified  as  being  measured  under  constant  conditions  of  volume  and  pressure. 


32.  Hydrolysis  Rate:  the  decomposition  of  a  compound  by  reaction  with  water.  A 
text  datatype  to  accommodate  various  methods  of  measurement  that  are 
temperature,  time,  and  pH  dependent.  Because  of  various  reporting 
conventions,  a  search  of  other  related  solubility  properties  should  be  done 
when  trying  to  match  agent/simulant  hydrolysis  rates.  After  a  compound  has 
been  identified,  determine  if  hydrolysis  data  is  available.  It  is  planned  to 
include  other  parameters  describing  solvent/solute  interactions. 

33.  Octanol/Water  Partition  Coefficient:  the  ratio  of  chemical  concentration  in  the 
QflflfuO  phi—  to  its  concentration  in  the  aqueous  phase  of  a  two  phase 
octanol/water  system.  Values  are  unitless,  may  be  positive  or  negative,  and 
range  from  *3  to  7.*  Partition  coefficient  are  useful  in  modeling  chemical 
transport  through  biological  systems  and  in  studies  of  environmental  fate. 
CRD  EC  has  computerised  estimation  routines  for  this  property. 


34.  HUdebrandt  Solubility  Parameter  a  quantitative  measure  of  solubility  for 
nonpolar  organic  compounds  calculated  as  the  square  root  of  the  cohesive 
energy  density,  and  defined  as  solubility  parameter  (H).  It  can  be  calculated 
using  data  with  the  following  equation: 


6m  ■ 


d(LH,  -  KT) 


HW 


(3) 


Appendix 


d  -  density  (g/cm) 

AHv  a  heat  of  vaporisation  (cal/g  mole) 

R  m  gas  constant  (1.067) 

T  =  temperature  (K  of  density) 

MW  s  molecular  weight 
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Some  consideration  most  be  given  more  polar  or  hydrogen-bonded  solvents 
because  of  the  presents  of  stronger  intermolecular  forces. 

3$.  Hygroscopkity:  the  ability  to  become  wet  by  absorption  of  water  from  the 
atmosphere.  Data  is  generally  qualitative  expressed  as  yes  or  no  with 
quantitative  data  compiled  when  available. 

36.  Water  Solubility:  this  text  datatype  is  for  qualitative  data  of  unsoluble,  slight, 
soluble,  and  miscible  categories. 

37.  Water  Solubility  Value:  a  quantitative  measure  of  water  solubility  expressed  in 
grams  per  100  mL.  The  highly  dependent  temperature  variable  is  included 
when  available. 

38.  Critical  Temperature:  the  temperature  above  when  a  substance  cannot  exist  in 
the  liquid  phase,  regardless  of  the  pressure.  *  Units  an  degress  Celsius. 

30.  Critical  Pressure:  the  pressure  required  to  liquefy  a  gas  at  the  critical 
temperature  *.  Units  are  atmospheres  (atm). 

40.  Critical  Volume:  the  volume  occupied  by  one  gram  of  fiquid  or  gas  at  its 
critical  temperature  and  critical  pressure*.  Units  of  measure  are  in  cubic 
centimeters  per  mole. 

41.  Critical  Density:  the  density  of  a  substance  at  its  critical  temperature  and 
critical  pressure  *  as  measured  in  grams  per  cubic  centimeter. 

43.  Dielectric  Constant:  the  ratio  of  the  capacity  of  a  condenser  urHh 
substance  as  dielectric  to  the  capacity  of  the  same  condenser  urHh  a  vacuum 
dielectric.  1  It  is  a  measure  of  the  amount  of  electrical  charge  a  substance 
withstand  at  a  given  electric  field  strength.  Thh  property  Is  temperature 
fptqiHPry  dtpt&dt&t  md  units  in  dintnsioitlist* 


Appendix 


-31- 


5.8  rf 


43.  Dipole  Moment:  the  summary  or  permanent  dipole  moment  is  dependent  on 
the  magnitude  of  the  charge  and  distance  separating  positive  and  negative 
charges.  It  is  the  summary  of  both  electrons  and  nuclei  charges  in  the  molecule. 
Numerical  values  are  expressed  in  debyes,  which  are  equivalent  to  10*18 
electrostatic  units  per  centimeter  (asu-cm) 5.  Dipole  moments  of  organic 
molecules  normally  fall  between  0  and  5  debyes. 


44.  Toxicity:  a  listing  of  various  values  to  indicate  relative  toxicity.  This  topic  is  a 
data  base  unto  itself,  and  the  requirement  for  detailed  toxicity  data  can  be 
better  served  through  access  to  existing  toxicity  data  bases. 

45.  Chemical  Reactivity:  a  text  descriptor  originally  designed  far  materials 
compatibility  data.  It  was  recently  expanded  to  include  all  types  of  reactivity 
data  with  other  chemical  compounds,  groups,  and  classes.  Little  data  has  been 
entered  on  this  subject  because  it  requires  searching  through  other  than 
simulant  related-literature. 


46.  Simulant  Application:  text  data  designed  to  describe  the  relevance  of  the 
compound  to  simulant  research.  Information  on  this  topic  is  currently  VnfatA 
to  simulated  agent  and  application  with  many  compounds  having  no  data.  It  is 
planned  to  expand  this  with  additional  datatypes  for  agent  simulated;  type  of 
research  activity;  selection  criteria;  shnuiant/agaat  corratation;  research 
methods;  efficacy;  and  statue  to  describe  If  the  simulant  was  used, 
recommended,  screened,  or  not  recommended.  Each  compound  in  the  data 
beet  should  have  at  least  one  entry  far  this  descriptor. 

47.  Industrial  Application:  a  description  of  the  application  and  prevalence  of  the 
chemical  in  the  private  sector.  This  wiU  assist  simulant  uaars  in  determining 
data  sources  end  likelihood  of  acquiring  use  approvals. 

48.  CommtnU:  tho  locitioft  lot  jiti  not  aptciftcityy  rotolod  to  tho  doiotypti 
ortvioottv  tkatribtd. 
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